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Abstract 

The logistic map is one of the simple systems exhibiting order to chaos tran- 
sition. In this work we have investigated the possibility of using the logistic 
map in the chaotic regime (logmap) for a pseudo random number generator. 
To this end we have performed certain statistical tests on the series of num- 
bers obtained from the logmap. We find that the logmap passes these tests 
satisfactorily and therefore it possesses many properties required of a pseudo 
random number generator. 
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1 Introduction 



A sequence of numbers that are chosen at random are useful in many different kinds of 
apphcations hke simulation, sampling, numerical analysis, decision making, recreation 
etc. A sequence of truly random numbers is unpredictable and hence irreproducible. 
Such a sequence can only be generated by a physical process, for example, radioactive 
decay, thermal noise in electronic devices, cosmic ray arrival time etc. In practice, 
however, it is very difficult to construct physical generators which are fast enough 
and at the same time accurate and unbiased. Furthermore, one would like to repeat 
the calculation at will, for debugging or developing the program. Thus for most cal- 
culational purposes, pseudo random numbers (prn) have been introduced. Pseudo 
random numbers are numbers computed from a deterministic algorithm (hence it is 
called pseudo random or quasi-random) and therefore reproducible. Obviously these 
are not at all random in the mathematical sense, but are supposed to be indistin- 
guishable from a sequence generated truly randomly. A good PRN generator should 
possess long period, high speed and randomness. 

Over the years various PRN generators have been developed and can be broadly clas- 
sified into the following categories [0: those are, i.) Linear recurrence methods, ii.) 
Multiplicative congruential generators, iii.) Tausworthe generators and iv.) Combi- 
nation generators. All these are bit based generators. These methods have various 
parameters or inputs and the period and the statistical properties of PRN sequences 
sensitively depend on these parameters. The first two are known to have periods of 
about ~ 10^ on a 32— bit machine. In the other two methods one can achieve much 
larger period (~ 10^'''°) 0. 

It is not easy to invent a foolproof source of random numbers. Generally a number 
of tests are performed to test the randomness of the numbers generated by prn's. 
In spite of these tests, one finds that a PRN that has passed these tests may fail 
when applied to some physical applications: For example, in a recent study Ferren- 
berg et al. have shown that even the high quality prns are biased under certain 
circumstances. Extensive Monte Carlo simulations by this group on an Ising model, 
for which exact answers are known, have shown that ostensibly high quality random 
number generators may lead to subtle, but dramatic, systematic errors for some algo- 
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rithms, but not others. They traced the discrepancy to the correlations in the random 
numbers. Another recent study by Vattulainen et al. find no such correlations. 
This only means that what is "random" enough for one application may not be ran- 
dom enough for another. The important criterion is that a specific algorithm must be 
tested together with the random number generator being used regardless of the tests 
which the generator has passed. This being the scenario, it may be useful to consider 
prn's based on algorithms different from conventional algorithms. 

In the present work we discuss a PRN generator based on an inherently chaotic (ran- 
dom) algorithm and its statistical properties. Here we have employed the logistic map 
in the chaotic regime as a PRN generator. We feel that such an effort is useful because 
it provides an entirely different method of producing prn's. Also the algorithm used 
is very simple, so the generator is quite fast. There have been earlier attempts [§] to 
use the logistic map as a random number generator. For example, the logistic map 
in the chaotic regime was used to calculate the properties of a well-known random 
process; the invasion percolation problem. In this study, it was found that the static 
properties of percolating clusters, except for the percolation threshold, are correctly 
calculated but the dynamical properties are not. It is not clear from this work if this 
difference is due to non-uniform nature of the distribution or due to the correlation 
between successive numbers. The method we are adopting here differs from the earlier 
calculations in two respects. In the earlier works the random numbers were drawn 
from the logistic map which is known to have a distribution of ~ \Jx{l — x) and used 
in the simulation studies. We use a simple transformation to convert this distribution 
into a uniform distribution. Also, the earlier calculations make no attempt to either 
remove or study the effects of the correlations that exist between succesive numbers 
generated by the logistic map. We have done that. Here we draw the numbers from a 
uniform distribution (see below) and investigate the statistical properties. Our calcu- 
lations show that the period of such a generator is certainly larger than 10^ (although 
theoritically infinite) if the computations are done in double precision. Our investi- 
gation of the distribution properties of this PRN generator shows some peculiarities. 
These can be cured by introducing the r-shift (to be explained below). In the follow- 
ing we shall first describe the logistic map and discuss the PRN generator in Section 
2. The results of different tests performed on prn's thus generated are presented in 
Section 3 and the conclusions are given in Section 4. 
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2 The Logistic Map 



Chaos in dynamical systems has been investigated over a long period of time 0. With 
the advent of fast computers the numerical investigations on chaos have increased 
considerably over the last two decades and by now, a lot is known about chaotic 
systems. One of the simplest and most transparent system exhibiting order to chaos 
transition is the logistic map 0. The logistic map is a discrete dynamical system 
defined by 0, 

Xi+i = fiXi{l - Xi) (2.1) 

with < Xj < 1. Thus, given an initial value (seed) xq, the series Xi is computed. 
Here the subscript i plays the role of discrete time. The behavior of the series as a 
function of the parameter /i is interesting. A thorough investigation of logistic map 
has already been done [7]. Here, without going into detailed discussion, we simply 
note that, 

• Eq. (|2.1|) has a; = and x = {fi — as fixed points. That is, if Xj = or 
{fi — 1)/ /i, then Xj+i = Xi. 

• For /i < 1, X = is an attractive (stable) fixed point. That is, for any value of 
the seed xq between and 1, Xi approaches exponentially. 

• For 1 < yU < 3, X = (/i — l)//i is an attractive fixed point. 

• For 3 < yU < 4, the logistic map shows interesting behavior such as repeated 
period doubhng, appearance of odd periods etc. 

• For /i = 4 the logistic map is chaotic. 

Since the chaotic behavior of the logistic map is of interest to us, we shall discuss the 
last point in detail. If we choose two seeds xq and yo = xo + Sx, with 6x arbitrarily 
small, Xi and yi differ by finite amount for large enough i. For example, if 6x = 10~^^, 
Xi — Vi ^ 0.1, for i ~ 40. In fact, Xj — yi grows exponentially with i and this is the 
definition of chaos. An analytic solution of logistic map exists for /x = 4. If we choose 
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Xj = (1 — cos(6'j))/2 and Xj+i = (1 — cos(26'j))/2 the logistic map for yU = 4 can then 
be defined by 

f 2ei, for < 7r/2 

^m= ' (2.2) 

[ 271 - 29i, for 9i > n/2. 

Clearly, the map in terms of 6i is given by stretching the line of length vr to 2tt and 
folding it. An examination Eq. (|2.2|) shows that one gets periodic series if the seed 6o 
is a rational fraction of it. On the other hand, the series does not have periodicity 
if 6q is an irrational fraction. Also, for any which is an irrational fraction of vr, 
the set {9i} computed using Eq.( p.2|) is distributed uniformly between and vr. It is 
this property of the logistic map that we intend to exploit for generating uniformly 
distributed random numbers and to study its statistical behavior. 

Consider a set of numbers 

cos^^ (1 — 2xi) 
Vi = ^ (2.3) 

TT 

where, 

Xi+i = Axi{l-Xi). (2.4) 

From the discussion above, ?/j's are expected to be distributed uniformly between 
and 1, except when Xi = |, |, | or 1. For these special values one gets periodic 
series. For any other rational fraction xo we do not expect any periodicity. Thus it 
appears that, starting with a rational fraction xq we can generate a set of uniformly 
distributed numbers {yi} using Eqs( p.3| , p^) . Furthermore since a small change in 
Xq produces large deviations in Xi's (and therefore ?/i's) different initial values of Xq 
differing by small amount would produce different uncorrelated sets {Vi}- This seems 
to be a very good property for a random number generator to possess. 

There are however two difficulties to be overcome before we can construct a random 
number generator from Eqs( p.3| , p.4| ). The first is due to truncation error in computers. 
We find that, when the calculations are done in single precision, Xi becomes after 
about 5000 iterations. The exact value of i depends on xq but it is ususally ~ 5000. 
The reason for this is as follows. If Xi differs from 1/2 by a small amount e (x, = 
0.5 + e), Xi+i = 1 — and if < 10^^, Xj+i is stored as 1 in the computer. So = 
for n > 2. This difficulty is overcome by either doing the calculation in double 
precision (higher precision, if possible) or by modifying the algorithm suitably, when 
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Xi is close to 0.5. We find that this problem does not arise for upto ~ 10^ iterations 
when the calculation is done in double precision. 

The second difficulty is that of correlation among the successive ?/i's. Eq(2.2) clearly 
shows that the succesive ?/j's are correlated, although a set of ?/j's are uniformly dis- 
tributed in the interval [0,1]. The standard procedure of removing such a correlation 
is to shuffle the set of numbers obtained from logistic map. Our calculations show 
that the correlations have peculiar effect on the distribution properties and these 
persist even after shuffling (see later). On the other hand, the correlations between 
two numbers i/i and j/j+T- reduce as r is increased from 1. We call these as r-shifted 
numbers. The statistical tests have been performed on these r-shifted set with r 
ranging from 1 to 14. The following calculations have been done in double precision. 



3 Distribution Tests 

In order to test the PRN generator (logmap) described in Section 2, we have per- 
formed certain tests, for various values of r, described below. For comparison we 
have also done these tests on PRN generator RAND- available on our machine and 
RANMAR- the generator proposed by Marsaglia and Zaman. These tests are: 

1. Distribution test: here we verify Central Limit Theorem. 

2. Moments Caclulation: We have calculated (x"") and their variance an- 

3. x^"test: In the 1-d case, we have divided the interval [0, 1] into n equal bins 
and calculated ^iid its distribution. For the 2-d case, we have divided the 
region [0, 1] ® [0, 1] into n equal blocks and repeated the calculation of and 
its distribution. 

In addition, we have also performed other tests p| such as run-up test, n-tuple test 
etc. Results of these tests will not be presented here. The main reason is that the 
results of these tests are in concurrence with the above mentioned tests and it will 
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not alter our conclusions. In the following, we report the results for LOGMAP and 
RANMAR. 



3.1 Distribution Test 



Suppose that {xj} is a sequence of mutually independent random variables that are 
governed by the probability density function P{x). Then the Central Limit Theorem 
asserts that, subject to certain conditions on the moments of P{x) the variable 
Un = Y.f=i Xi, in the hmit of large N, are distributed normally: i.e., 

where = N{x) and o"^ = N{{x'^) — (x)^). 



We have obtained the distribution function of the numbers obtained from logmap for 
N = 24 for various r-shifts. In fig.(l), the soild line indicates the exact distribution 
and diamond dots indicate the distribution obtained from the logmap with r = 
1,2 and 3 calculation. From fig.(l), one can see that for r = 1 (sequential case), the 
distribution from the LOGMAP is skewed a little. Though the distribution for r = 2 
is better agreed than the sequential case, still it is far from satisfactory. For r = 3 
and greater, the distribution is in excellent agreement with the theoretical one. Thus 
this test shows that, although the successive numbers are correlated, a set of numbers 
obtained by picking every alternate or every third number, would behave as a set of 
random numbers 0. 



3.2 Moments 



For a given set of r-shifted N numbers generated from LOGMAP, we calculate n^^ 
moment (x") as 

1 ^ 

«) = ]7E^r+.- (3.1) 

i=l 



6 



Wc then repeat this test for M such sets and caclulate the global average (x") and 
variance an as 

1 M 

(K) = M^liOj (3-2) 

1 M 

^n{r) = ^EK)?-(^r) (3-3) 

We have varied r from 1 to 14 for two different values of A^-1024 and 8096 and M 
is held fixed at 4000. For random numbers uniformly distributed between and 1 
(x")* = and (7^'^ = (2„_|_i)(^„_|_i)^ • The results for the PRN generator based on 
logistic map are shown in Table 1. 

An inspection of Table 1 shows that (x") agrees well with the theoretical values for 
different n's and r's. The agreement is comparable with the results of RANMAR. 
But cr„(r) departs significantly from the theoretical value for r = 1, and the depar- 
ture systematically increases with the increase in n. The variance is calculated by 
subtracting two almost equal numbers, so some loss of accuracy is expected. But 
the departure is much larger. On the other hand, for r larger than 4, the calculated 
variances are close to the theoretical values and are comparable with those obtained 
from RANMAR. 



3.3 x^-Test 

In 1-D test, the interval [0,1] is divided into n equal parts and the numbers falling 
in i*'* interval, out of a set of N numbers is calculated. For a uniform distribution, 
is defined by, 

^" N/n ^ ^ 

The test is repeated for M such sets and the distribution of xi obtained. The- 
oretically, the thus obtained should have a distribution for n — 1 degrees of 
freedom: 
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The calculation is done for N = 10240, M = 4000, n is varied from 2 to 256 and r is 
varied from 1 to 14. The results for n = 4, 64 and 256 are displayed in figs(2a,2b and 
2c). The corresponding Xn RANMAR is also given for comparison. 

Consider r = 1 or the sequential case first. The calculated distribution differs 
significantly from the theoretical one. This is somewhat surprising, since the set of 
numbers obtained from the logistic map are are uniformly distributed. We have con- 
firmed that shuffling does not mitigate this problem. Thus the correlations between 
successive numbers are probably responsible for this behavior. 

As T is increased, the agreement between calculated and theoretical distributions 
improves. Prom fig. (2a), we find that for n = 4, r = 2 distribution already agrees 
reasonably with the theoretical distribution. On the other hand, for n — 256, one 
gets a good agreement for r ~ 6 or larger. This clearly shows that, in order to have 
'randomness' on a finer scale (corresponding to smaller bin size, or larger n), the 
r-shift must be larger. 

For 2-D x^-test, we choose a pair of succesive numbers (x and y coordinates) and 
determine how they are distributed in n equal-area blocks covering a square of unit 
side. The distributions are calculated as discussed in the 1-D case and the results 
for n = 4, 64 and 256 are presented in figs(3a, 3b and 3c). These results follow the 
same pattern as that of 1-D case with one difference. For t — 1, the calculated 
distribution is nowhere near the theoretical distribution for n > A. This is simply 
because the succesive numbers are highly correlated. This can be explained as follows. 
For the sequential case, when the unit square is divided into four equal blocks, each 
block would have some number of points. Hence for n — A, one does not expect any 
abnormal behavior in the x^ distribution. However, this is not the situation when 
the unit area is divided into more than 2x2 blocks; it would so happen that some of 
the blocks do not contain any points at all. The contribution of these blocks to the 
X^ (see eq.(3.4)) would be N/n, thus pushing the value of x^ higher. However, with 
a T-shift of 4 or larger, these correlations are more or less wiped out and one gets 
reasonably good agreement between calculated and theoretical distributions. 
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4 Summary and Conclusions 



Various tests on the series of numbers obtained from the logmap have been performed 
in this work. These tests bring out certain pecuharities which have not been noted 
before. We notice that, 

1. the distribution test does not satisfy the central hmit theorem for r = 1. The 
correlations between succesive numbers, which are known to exist, are probably 
responsible for this. But for r > 2, the agreement with the central limit theorem 
is excellent. 

2. The moments calculation comfirms the result of the distribution test. That is 
the calculated moments agree with their theoretical values, even for r = 1. The 
variances of these moments disagree with their theoretical values for r = 1. 
However, for r > 4, the moments as well as variances agree with the theoretical 
values. This shows that the correlations between succesive numbers play a subtle 
role in moments calculation and removal of these correlations (by r-shifting) is 
essential. 

3. the x^-tests show that, for r = 1 the ^^-distributions systematically differ from 
the theoretical distribution. The agreement between calculated and theoreti- 
cal distributions is improved by increasing r. Thus, one must use r-shifted 
numbers (with larger value of r for smaller interval-size) to obtain acceptable 
X^-distribution. 

As far, computer time, the LOGMAP is about three times slower than the RANMAR 
However, in the calculations where prn's are used, a relatively small fraction of time is 
spent in generating random numbers. Therefore the relative slowness of the LOGMAP 
is not a big handicap. Furthermore, independent sets of random prn's can be easily 
obtained from the LOGMAP by choosing different values of seeds Xq's. This is so 
because two seeds differing by a small amount would generate entirely different series 
after a few iterations. This property makes the LOGMAP easily adaptable to parallel 
processing. 
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To conclude, we note that r-shifted LOGMAP satisfies some of tlie elementary tests a 
pseudorandom number generator must pass. The LOGMAP being based on a physically 
chaotic process, the calculations where randomness, as opposed to computer time is 
important, it is advantageous to use LOGMAP. 
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Figure Captions 



fig.(l) Central Limit Theorem Verification for 24 numbers. The solid curve is obtained 
from the exact distribution Pniy) = ^2nNa^ ^"^P ~ ^^2Na^ ' where N — 24, 
/i = 1/2 and cr^ = 1/12. The diamonds with dot are obtained from the logmap with 
r = 1, 2 and 3 (as indicated by the label). 

fig. (2) The distribution of the LOGMAP for the 1— dimension case. The smooth 
curve is obtained from the exact distribution, eqn.(3.5) and the histogram is obtained 
from the LOGMAP and the RANMAR, labelled appropriately, a) for 4 bins, b) for 64 
bins and c) for 256 bins. 

fig. (3) The distribution of the LOGMAP for the 2— dimension case. The smooth 
curve is obtained from the exact distribution, eqn.(3.5) and the histogram is obtained 
from the LOGMAP and the RANMAR, labelled appropriately, a) for 4 bins, b) for 64 
bins and c) for 256 bins. 
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n 


r = 1 


T = 3 


r = 6 


RANMAR 


exact 


1 


{x) 


.50010 


.50006 


.50000 


.49993 


.50000 




0"! 


.08400 


.08460 


.08626 


.08155 


.08333 


2 




.33340 


.33341 


.33332 


.33324 


.33333 






.03736 


.08784 


.09102 


.08736 


.08889 


3 




.25004 


.25007 


.24999 


.24991 


.25000 






.01392 


.07828 


.08111 


.07964 


.08036 


4 




.20002 


.20006 


.20000 


.19992 


.20000 




0-4 


.00518 


.06839 


.07092 


.07086 


.07111 


5 




.16667 


.16672 


.16667 


.16660 


.16667 




0-5 


.00273 


.06002 


.06240 


.06305 


.06313 


6 




.14285 


.14291 


.14286 


.14280 


.14286 




0-6 


.00276 


.05318 


.05550 


.05642 


.05651 


7 




.12498 


.12505 


.12500 


.12495 


.12500 




0-7 


.00363 


.04759 


.04990 


.05087 


.05104 


8 




.11109 


.11115 


.11112 


.11107 


.11111 




0"8 


.00469 


.04298 


.04528 


.04620 


.04648 


9 




.09997 


.10004 


.10001 


.09997 


.10000 






.00567 


.03913 


.04143 


.04225 


.04263 



Table 1: (x") and a,, for LOGMAP with r = 1, 3 and 6 and RANMAR for 
n = 1, 2, • • • , 10. N = 8196 and M = 4000. 
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